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NEC ACOS System 3900 


Outline of the Super-Large-Scale Mainframe 
ACOS System 3900 


94FE0465A Tokyo NEC TECHNICAL JOURNAL 
in Japanese Feb 94 pp 1-4 


[Article by Mikio Kuwata, Computers Division; T. Kondo, 
Computers Division; M. Akagi, Computers Division; A. 
Date, Basic Software Development Division; and K. Ike- 
hara, Basic Software Development Division] 


[Text] To cope with market needs for large-scale com- 
puter systems in the 1990s, NEC has developed a super- 
large-scale mainframe ACOS System 3900 making use of 
the latest technologies. 


The ACOS System 3900 has been achieved based on 
ACOS-6 architecture and is one of the world’s largest 
and most powerful systems. It has many advantages 
worthy of a super-large-scale system, such as large-scale 
data processing capability, high-speed data processing, 
high reliability, and high connectivity to open systems. 


1. Introduction 


The sizes of computers are decreasing as downsizing 
continues in the 1990s. Simultaneously, the need for 
super-large-scale mainframes is growing. In today’s infor- 
mation society, the super-large-scale mainframe’s features 
of a large scale, high reliability, and high performance are 
requirements. Information systems built around worksta- 
tions still cannot satisfy the market’s demand for these 
features. Meanwhile, open systems are soundly pro- 
gressing. As hardware like workstations and personal com- 
puters supporting standard operating systems (OSs) 
expands, new functions will be needed to design for the 
specialization and co-existence of these systems with a 
general-purpose computer having a proprietary OS. 


In this backdrop, the ACOS System 3900 is a super- 
large-scale system architecture developed and imple- 
mented as the successor to the older ACOS System 2000 
model. In addition, the new ACOS-6/NVX operating 
system was developed to draw out the hardware’s power. 


In this paper, we introduce the development objectives 
and hardware features of ACOS System 3900, and 
present a complete overview of ACOS-6/NVX. 


2. Development Objectives 


In today’s advanced information society, super-large-scale 
mainframes feature a huge amount of information storage, 
ultra-high-speed data processing, and very high reliability, 
so they form the core of information systems. In the future, 
in line with developments in downsizing and open sys- 
tems, the major trend in systems will be a variety of 
different types of systems connected through networks. 
Within this trend, the super-large-scale mainframe system 


will play the role of server for large-scale database pro- 
cessing, system management, and large numbers of work- 
stations and personal computers, as well as be used as the 
core of the network. 


Given this background of demands for this type of 
super-large-scale mainframe system, ACOS System 3900 
was developed based on the following basic objectives. 


(1) High-level performance and larger scales 


The quantity of information being processed and the 
processing performed by computers are increasing annu- 
ally. This increase cannot simply be thought of as an 
extension of current conditions. The reason being, this is 
not merely a quantitative increase in the amount of data 
handled, but reflects changes underway in the uses of 
computer systems. For example, previous mainframe 
computer systems were primarily used in business man- 
agement such as sales management and salary calcula- 
tions in corporations. While today, they collect and 
analyze information for business strategies. The manner 
of using these computers changes minute by minute. And 
in the future, the need will increase for high-speed 
processing of greater quantities of information. 


(2) Networking and open systems 


Today, almost all systems, from super-large-scale main- 
frames to workstations and even personal computers, are 
connected to networks. The trend has become systems 
on networks cooperate with one another. Therefore, 
interoperability between workstations and personal 
computers on the network will become increasingly 
important. 


(3) High reliability and no downtime 


The most important element in realizing functions as the 
core in information systems is said to be reliability. This 
is no exaggeration. As society becomes more advanced 
and complex, the requirement for reliability in systems 
becomes vital. In the future, to realize the functions in 
the core of highly reliable systems and networks, high 
reliability is demanded to enable continuous operation 
for 365 days, 24 hours a day for super-large-scale main- 
frames. 


The ACOS System 3900 was born as the next-generation 
system satisfying the conditions required in future super- 
large-scale mainframe systems while inheriting the mas- 
sive ACOS-6 software assets. 


3. An Overview and Features of the Hardware 


ACOS System 3900 is one of the fastest super-large-scale 
general-purpose computers in the world having twice the 
performance and 512 times the virtual addressing capa- 
bility than the earlier ACOS System 2000. 


Table | lists the system specifications of ACOS System 
3900. The configurations provided range from the single 
processor Model 10 to the eight processor Model 80 used 
in the largest configuration of ACOS System 3900. 
Upgrading to high-end models is possible in the field. In 
this section, we will introduce the major hardware fea- 
tures divided into four elements. 
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Table 1 Specifications for the ACOS System 3900 

Model 10 20 40 50 60 70 80 
' Number of execution l 2 2 4 5 6 7 » 

processing units (EPU) 

Number of system con- l I 2 2 2 2 2 2 

trol units (SCU) 

Cache Instruc- 64 64x2 64x4 64x5 64x6 64x7 64x8 

memory tions 

capacity 

(kilobytes) | Operands 64 64x2 64x4 64x5 64x6 64x7 64x8 

Main memory capacity 512 1,024 ($12x2) 

(megabytes) 

Extended main 64 128 (64x2) 

memory capacity 

(megabytes) . 

Expanded memory 8 (256 megabytes expansion unit) 

capacity (gigabytes) 

Built-in array processor Optional equipment 

Maximum number of 256 $12 

channels 

Channel transfer speed , 3/4.5/6/9 (optical channels) 

(megabytes/second) 





Overall transfer ability | 
(gigabytes/second) 








2 





3.1 Technology 


To realize a super-large-scale, super-high-speed main- 
frame, technology to densely package very high-speed 
and highly integrated LSI becomes a requirement. Very 
high-speed LSI, high-density packaging technology, 
cooling technology, and power source technology uti- 
lized in the SX-3 supercomputer are adopted In ACOS 
System 3900; thus, higher performance, higher reli- 
ability, and smaller sizes are achieved. 


These technology features are presented below. 
(1) Very high-speed LSI technology 

Table 2 lists the specifications of the LSI used. 
(2) Ultra high-density packaging technology 


¢ 1) Uses an ultra-compact LSI chip carrier (FTC) for 
high-density connectivity and high efficiency cooling 

¢ 2) Uses a high-density multilayered ceramic substrate 
with dimensions of 22.5-cm?, 25-4m wire width, 
75-um wire pitch, and an LSI high-density package 
(MCP) capable of holding a maximum of 100 LSIs 


(3) Cooling technology 
1) Adopts liquid cooling to obtain stable cooling 


2) Uses a liquid cooling unit (CLU) with dual air-cooling 
and water-cooling for the user’s cooling equipment 


(4) Power source technology 


1) Uses water-cooled power module (DC-DC converter) 
at low voltages (2-5 V) and large capacity (3 kW) 


2) Power source control from remote areas using optical 
mode channel 





Table 2. Specifications for the LSI chips 

















Logic LSI LSI with RAM | LSI with RAM 

Number of 20K gates 5K gates 7K gates 
gates 
RAM integra- 64 kb 40 kb 
tion degree 
Gate delay 70 picosec- 70 picosec- 70 picosec- 

onds onds onds 
RAM access 1.6 nanosec- 1.6 nanosec- 
time onds onds 














3.2 Extending the architecture 


To implement a system capable of processing even 
greater quantities of information, the architecture 
forming the basis of the system must be extended. In 
ACOS System 3900, the ACOS eXtended System Archi- 
tecture (XSA) was adopted to more rapidly process 
greater quantities of information. 


Below, we present an overview of the main extensions in 
XSA. 


(1) Extending virtual memory space 


A virtual memory space of 4 PB (1 petabyte = 10'° 
bytes), which is 512 times that in the older ACOS-6 
architecture, can be used in XSA. 


(2) Extending real memory space 
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A real memory space up to 512 GB (1 gigabyte = 10° 
bytes) can be used in XSA. (In contrast, up to one 
gigabyte is implemented in the hardware of the ACOS 
System 3900.) 


(3) Extending the (number of processors)/(number of 
channels) 


A maximum of eight execution processing units and I/O 
processors can be interconnected through a maximum of 
512 I/O channels. 


(4) Supports for the expanded memory unit 


The expanded memory units are memory units installed 
between peripheral memory like main memory and 
magnetic disk units and have a maximum capacity of 8 
GB per system. Data transfer instructions between 
expanded memory and main memory are supported and 
a fast transfer speed at the maximum of 400 MB/s is 
possible. 


3.3 High-speed technologies 


The high-speed pipelining used in ACOS System 2000 
was adopted and augmented for use in the execution 
processing unit of ACOS System 3900. In pipelining, one 
instruction is divided into multiple execution stages. 
Each stage provides a dedicated circuit to process inde- 
pendently. Stages for different serial instructions are 
processed concurrently. 


Generally, in pipelining, the efficiency of pipeline pro- 
cessing is diminished by variations in the instruction 
sequence due to branch instructions and interference 
between instructions where a previous instruction over- 
writes the operands of succeeding instructions. These are 
the sources of lower performance. To avoid this decrease 
in pipeline processing efficiency in ACOS System 3900, 
branch prediction, buffers between stages, and bypass 
mechanisms for each type of data are extended. Also by 
partitioning the control memorv based on specific 
instruction types and constructing a hierarchical buffer 
in the address translation buffer, increased indepen- 
dence of the circuits of each function and faster clocks 
are designed. 


In the I/O processors for ACOS System 3900, a hierar- 
chical structure is adopted for channel management and 
channel control. Therefore, processing corresponding to 
each function is distributed to each unit and highly 
efficient I/O can be performed. 


3.4 Reliability technologies 


To implement a highly reliable system, the needed 
advanced technologies range from the device level to the 
system level. These technologies, cultivated over many 
years. are being further developed in ACOS System 
3900. Reliability, availability, and serviceability are 
achieved. 


(1) Reliability 


Reliability is difficulty in failing. Reliability can be 
calculated based on the sum of the failure rate of each 
part being used. That is, by building a system from parts 
that seldom fail and from as few parts as possible, a 
highly reliable system can be realized. 


In addition to increasing the reliability of each used part 
in ACOS System 3900, by increasing the integration of 
LSI, decreasing wiring between parts, and the large-scale 
integration of the control package of the power source 
system, the number of parts is significantly decreased 
and reliability is achieved. 


(2) Availability 


Availability is the continuation of system operation to 
minimize the effect of failures when they occur. To 
achieve availability in ACOS System 3900, various func- 
tions are created in hardware and software. Below, we 
introduce the main functions provided to achieve avail- 
ability. 


1) Automatic error correction in main memory, 
expanded memory, and control memory 


2) Instruction retry processing by the processors 


3) Another processor taking over the processing from a 
failed processor in a multiprocessor system (processor 
relief) 


4) Detached processing of fixed failed circuits in main 
memory, cache, and peripheral control system path 


5) Detached processing of failed devices 
(3) Serviceability 


Serviceability is defined as being able to repair failures as 
quickly as possible and return to the normal state when 
failures occur. When failures occur in ACOS System 
3900, Built-In Diagnostics (BID) is employed to auto- 
matically display suspected failed parts and suspicion 
probabilities. Additionally, the service center is con- 
nected by public circuits and advanced remote mainte- 
nance can be performed by a technician at the service 
center. 


4. Strengthening the ACOS-6 Operating System (OS) 


In providing a system that addresses market needs and 
makes full use of hardware functions and performance, 
the role played by the OS is critical. ACOS-6/NVX is 
based on ACOS-6/MVX II and addresses recent changes 
in the use of information systems. It was developed as an 
OS to draw out the limits of the hardware performance 
of ACOS System 3900. 


Along with the large scale, advanced performance, high 
reliability, and operability for the centralized processing 
host, suitable fundamental functions are offered in 
ACOS-6/NVX in the large-scale data servers in a distnib- 
uted processing environment. 


Next, we present the major features. 
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(1) Large-scale system support by a multiprocessor 
system up to a maximum of eight processors, virtual 
memory space expansion, expansion of the number of 
disk structures 


(2) Support of high-speed data processing utilizing 
expanded memory 


(3) Reliable system support by LCMP compatibility of 
DVF (disk doubling function) and host standby function 


(4) Augmenting the operation automation and labor- 
saving functions for integrated storage management and 
cartridge library management 


(5) Substantial improvement in relational database func- 
tions (RIQSII V2) 


(6) Improving open system connectivity/cooperating 
functions in each protocol of TCP/IP (UNIX'), OSI, 
SNA” 


(7) Support of CASEWORLD, a productivity software 
development support system 


5. Conclusion 


We presented the objectives of ACOS System 3900 
development and an overview of the system. ACOS 
System 3900 inherits the software assets of ACOS-6 and 
is the newest system developed with the concept of the 
role it should play as a super-large-scale system of the 
1990s. As the core of the more advanced and complex 
information society, we hope ACOS System 3900 can 
contribute to the system development for many more 
users. 


Footnotes 


1. The UNIX Operating System is developed and 
licensed by UNIX System Laboratories, Inc. 


2. SNA is a trademark of IBM. 
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Architecture of the ACOS System 3900 


94FE0465B Tokyo NEC TECHNICAL JOURNAL 

in Japanese Feb 94 pp 5-10 

[Article by Toshiyuki Migita, Computers Division; K. 
Iwase, Computers Division; T. Kishi, Computers Divi- 
sion; M. Kimura, Computers Division; K. Fujiwara, 
Computers Division; M. Kaneda, C&C Systems Busi- 
ness Promotion Division; H. Takagi, NEC Engineering, 
Lid.; Y. Nakai, NEC Engineering, Ltd.; and T. Sawai, 
NEC Kofu, Ltd.] 


[Text] 


Abstract 


The ACOS System 3900 is the high-end model of the 
ACOS computer system. 


In this paper, we introduce features of the design of 
ACOS System 3900. In particular, an overview of the 
system configuration, system performance, and virtual 
memory system are described. 


1. Introduction 

As a pivotal system in the advanced information society, 
ACOS System 3900 ($3900) is the high-end model of the 
ACOS System developed to address user needs for faster 
speeds, better reliability, and expandability. 

The §$3900’s architecture increased the maximum 
number of processors to eight in a tightly coupled 
multiprocessor (TCMP), which previously was four pro- 
cessors in the ACOS System 2000 (S2000) model. In 
addition, the new architecture achieved an expanded 
data space of 4 PB (petabytes: peta = 10'*), a real address 
space of 512 GB (gigabytes: giga = 10°), and support for 
an expanded memory unit (EMU) positioned in a new 
memory hierarchy. 

In this paper, we describe the $3900’s design features in 
the system configuration and the virtual memory 
method. 


2. System Configuration and Features 


2.1 Features of the system configuration 
The table lists the system specifications of the $3900. 
Figure | shows an example system configuration. 





Table Specifications of the ACOS System 3900 



























































Item $3900 5 08 
Sertes 
10 20 30 40 50 60 70 80 
Number of execution 1 2 3 4 5 6 7 Ly 14 
processing units units 
Number of instruc- 364 (not including IAP), 491 (while IAP installed) 459 
tions 
Address format 0, 1, 2, 3 addressing methods 
Address modifications Direct, indirect, tally, relative, index 
Logical data (fixed length) 18, 36, 72 bits 
Binary fixed-point 18, 36, 72 bits 
Binary and hexadecimal floating-point 36, 72, 144 bits 
Data format Decimal fixed-point 1-63 digits (pack, unpack) 
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Table Specifications of the ACOS System 3900 (Continued) 
Item $3900 $2000 
Series 
Decimal floating-point 1-63 digits (8-b*t exponent) 
Character data 1 byte - | MB 
Logical data (bit string) 1 bit - 1 Mb 
Addressing methods Segmentation, paging 
Memory protection Domain protection 
128 kB 
Cache memory Instruction cache: 64 kB 
capacity 
(per execution pro- Operand cache: 64 kB 
cessing unit) 
(Block size) (128 bytes) | (64 bytes) 
Address translation 2,048 entries 
buffer capacity 
(per execution processing unit) 
Number of system I 1/2 2 1-2 
control units (SCU) 
Maximum capacity of 512 MB 512 MB/1 | GB 512 MB/I 
main memory GB GB 
Maximum capacity of 8 GB None 
expanded memory 
Number of I/O pro- 1-4 1-8 2-8 1-4 
cessors (units) 
Maximum number of 256 256/512 $12 128/512 
I/O channels (lines) 
Maximum overall 1,024 1,024/ 2,048 192/384 
data transfer capacity 2,048 
(MB/s) 
Integrated array pro- Options that can be installed Standard 
cessor (IAP) equip- 
ment 











Note: The $3900/Model 20 and $2020 are two models 
with one and two system control units. The two models 
have the maximum capacity of main memory and the 
maximum number of I/O channels (SCU 1/2: SCU - 1 
unit/SCU - 2 units). 


One feature of the $3900’s system configuration is a 
tightly coupled multiprocessor system which can be built 
from up to a maximum of eight executing processing 
units (EPU), twice the maximum in the $2000, and 
stepwise expansion of the required performance and 
functions of the system is facilitated. 


$3900 consists of the eight models of $3900/10, 20, 30, 
40, 50, 60, 70, and 80. The models contain one, two...., 
and eight EPUs, respectively. Among these systems, 
moving to a better system (field upgrade) at the installa- 
tion site is possible. Thus, a flexible response to system 
performance needs is possible. 


Moreover, the expanded memory unit (EMU) is utilized 
in the $3900. System performance can be improved by 
adding EMUs. Furthermore, a high-performance loosely 


coupled . ultiprocessor (LCMP) system combined in the 
EMU and a high-speed switching hot standby system can 
be configured. 


2.2 Equipment Overview 


The $3900’s execution processing unit (EPU) increases 
processing performance by improving the branch predic- 
tion mechanism that remedies disorder in the pipeline 
caused by the frequent arrival of branch instructions, 
and including cache memory at the maximum capacity 
(128 kB) partitioned for instruction use and operand use 
and advanced pipeline control. In addition, the optional 
integrated array processor (IAP) can reach high speeds in 
vector operations. 


The system control unit (SCU) is the unit placed at the 
center of the system comprised of an interface control 
unit that controls data transfer between EMUs, I/O 
processors (IOP), main memory units (MMU), and RAS 
controllers that control the configuration, perform diag- 
nosis, and process faults. 
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Figure 1. Configuration of the ACOS System 3900/Model 80 


Key: 1. Central Processing Unit (CPU) 2. Connected to other loosely coupled hosts 3. System Control Unit (SCU) 4. 
RAS Controller 5. CPU: Central Processing Unit 6. EPU: Execution Processing Unit 7. SCU: System Control Unit 8. 
IOP: Input/Output Processor 9. MMU: Main Memory Unit 10. EMU: Expanded Memory Unit 11. OPS: System 
Operating Station 12. SVP: Service Processor 13. URP: Unit Record Processor 14. OD: Operator Display 15. SD: 


Status Display 





The main memory units (MMU) are connected to the 
SCUs. One MMU has a maximum memory capacity of 
512 MB, so a two-SCU system can have | GB of main 
memory. 


The I/O processor (IOP) is connected to the SCU. Up to 
a maximum of eight can be connected in each system. In 
$3900’s IOP, a maximum of 512 channels per system 
(twice that in the $2000) and a maximum overall data 
transfer capability of 2 GB (5.2 times that in $2000) are 
implemented. 


The expanded memory unit (EMU) achieves a max- 
imum of 4 GB per unit and a maximum capacity of 8 GB 
per system. 


3. Architecture of the execution processing unit 


3.1 Instruction and data formats 


As shown in the table, the $3900 provides about 500 
instructions (including the optional IAP) suited to a wide 
range of applications from business to scientific calcula- 
tions. These instructions include ones to process dif- 
ferent data formats written by data descriptors. 


The $3900 can handle a variety of data formats such as 
characters, bit strings, decimal fixed- and floating-point 
data. binary fixed-point, and binary and hexadecimal 
floating-point data. In decimal codes, overlapping codes, 
separable codes, and packed and unpacked formats can 
be handled. In binary and hexadecimal floating-point 
data, high precision calculations are possible in single 
precision for a 28-bit mantissa (significant digits: 8 digits 
for decimal), in double precision for a 64-bit mantissa 
(significant digits: 19 digits for decimal), and in extended 
precision for a 124-bit mantissa (significant digits: 37 
digits for decimal). 


3.2 Virtual memory 


The ACOS System uses a virtual memory based on 
segmentation and paging. In the $2000, the maximum 
segment size a user could use was 4 GB. While in the 
$3900, the maximum segment size can be expanded to 
64 GB (about 69 billion bytes). As a result, processing 
typical large-scale array data partitioned into a number 
of segments and access as one segment are possible. This 
iS a Major contribution to simplifying the program and 
accelerating execution performance. 
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Figure 2. Relationship between virtual storage and main memory 
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Key: |. Virtual Space Register 2. Virtual Space 3. Segment Descriptor 4. Upper Limit, Attributes, WSR, Type 5. 
Segment Base 6. Segment (maximum 64 GB) 7. Operand 8. Page 0 9. Effective Address 10. Page n-2 11. 4 kB, Page 
n-1 12. Page Table 13. Page Address 14. Main Memory 15. Real Page 0 16. 4 kB 





Next, we present an overview of virtual memory in the 
$3900. 


As shown in Figure 2, the system’s address space has a 
maximum of 2°? bytes (about 4,500 trillion bytes). This 
is partitioned into 2'* (262,144) virtual spaces (working 
spaces). These are managed in the hardware by registers 
(WSRn) that hold the virtual space numbers, 0 to 
262,143. One virtual space is partitioned into multiple 
segments (an arbitrary number of segments). A segment 
is a connected region of variable length in virtual space 
and works in logical units of the information (i.e., 
procedure segment storing instructions, linkage segment 
storing link information between programs, data seg- 
ments storing data). A segment is stipulated by segment 
attributes such as the location in virtual space, size, 
access permission (i.e., readable, writable, executable). 
Conseque-itly, very fine control (invalid access prohibi- 
tion) is possible for each characteristic of stored infor- 
mation. 


Virtual space is segmented into units of fixed-length 
pages of 4 kB each unrelated to the segment. The page 
table provided in each virtual space is used and mapping 


to main memory is performed. Thus, the effective use of 
the physical resource of main memory is designed. 


Translation from the address specified in the instruction 
to the address in main memory is partitioned into two 
processes of virtual address generation and real memory 
address generation. 


(1) Virtual address generation 


The address indicated in the command language deter- 
mines the effective address (relative address from the 
segment base) based on the address description of the 
command language and address modification descrip- 
tion. Then, this address is converted into a virtual 
address based on the segment descriptor, the working 
space number indicated by WSRn, and the effective 
address. While the effective address had 34 bits (16 GB) 
in the $2000, it can be specified in up to 36 bits (64 GB) 
by the newly provided extended addressing mode 
accompanying the expansion of segments in the $3900. 


Figure 3 illustrates virtual address generation as an 
example of the operand address of basic instructions. 








This report contains information which is or may be copyrighted in a number of countries. Therefore, copying and/or 
further dissemination of the report is expressly prohibited without obtaining the permission of the copyright owner(s). 























































































































* JPRS-JST-94-019 
28 July 1994 
Ll PELALYAI(ARN) 
023 1718 27 28293035 
—tn} sy 5] @3-F an] 1| TAG] 
2457+ BFL729 (ORn) —— o ——_ 35} 
. -_ ls Crew errr eweroer= s:s Y 
0 1920 29 |32 35 - - 7 
vb | tt |wsr' | 947 
4P a = | PFLARB(1 D7 7 28RS) be 
vbkA— ail 
; , 3 ' % 19 
: 7 BRFz7 | 8  exsrmsestrKLa (RMT) 
0 + 35 
— 9 #74rbx-—2 | 
LL 
1 ome 29(wsan) O12 36 
0 17 |; l2esemaay7r2 | 
‘OR; 


0 























: - 15117! 
Sea ' 
= {aeenes | 


17; 18 51 














14|RMaSeMss! 15 wWSOAMRTKLA 


| (7 KL) 





Figure 3. Virtual-address generation 


Key: 1. Address Register 2. Operand Descriptor Register 3. Segment Upper Limit, Attributes, WSR, Type 4. Segment 
Base 5. Instruction Code 6. Address Modification (like index modification) 7. Boundary Check 8. Relative Address in 
Segment 9. Segment Base 10. (Effective Address) 11. Virtual Space Register 12. Relative Address in Virtual Space 13. 
Virtual Space Number 14. Effective Virtual Space Number 15. Relative Address in WSQ 16. (Virtual Address) 





(2) Generation of the main memory address from the 
virtual address 


A virtual address is logically divided into several fields. 
Each translation table (i.e., page table) is used to obtain 
the main memory address. Further, the address trans- 
lation buffer (TLB) stores the correspondence of the 
previously converted virtual address and the main 
memory address. This results in faster main memory 
address generation. 


Accompanying the expansion of virtual space, the size 
(connected main memory addresses) of each type of 
address translation table tends to increase. In the $3900, 
however, two modes were added mapping from a virtual 
address to a main memory address. Therefore, the 
required size for addresses that connect each type of 
table did not increase. 


Figures 4 and 5 illustrate the operation of main memory 
address generation based on the modes added in the 
$3900. 


3.3 Expanded memory unit (EMU) 

In addition to a maximum | GB main memory unit in 
the $3900, an expanded memory unit (EMU) provided 
with a maximum capacity of 8 GB and a transfer 
capability of a maximum 400 MB/s was supported. 


Since the EMU can be used to expand the capacity of 
main memory and as a high-speed file server, new data 
transfer instructions between the EMU and main 
memory were added. 


The two types of data transfer are asynchronous transfer 
performed by the IOP and synchronous transfer where the 
EPU performs transfer processing. In addition to the EMU 
address in EMU data transfer as the absolute address, 
mapping based on virtual addresses is also supported. 


Furthermore, even between multiple systems that are 
not direct:y connected through system control units, 
mutual communication between systems and sharing 
data in expanded memory are possible by connecting via 
the EMU. Consequently, instructions (one type) that 
perform communication between systems via EMUs and 
instructions (one type) for exclusive control between 
systems were added. 








This report contains information which is or may be copyrighted in a number of countries. Therefore, copying and/or 
further dissemination of the report is expressly prohibited without obtaining the permission of the copyright owner(s). 


















































































































































JPRS-JST-94-019 9 
28 July 1994 
O) “> ve mw Pw st 
2 | P3s7INS*een=" oo 
twycu ; ewsgre 3 ere saes) &—sasena sy TrE2 Berra 
Tease 
THe 22D: s7 rinse) 
, 2 e 
“a—-3L_v1¢ now _ ~— 
ry ” | Rarhex re T1 10 
i vet 
t "y 5 , 
[ t 
ewersy rn ll en! 
L : ” ‘ 12 
= ae : thsoee 
' 
Mee S12 >| scr : 
cot ' 
ror ' 
=: - 
Moe pew (S12 Ener, - — ° y 
16 et °F eX -2F-7A pack: 
SSID: GSI K— FTA Pt , \ 14 
dS = | oom 
al A -LF—IA ot. 
CRW | Ft bX —279-4 <-o" 
Plow 4 — SF — IAF OPT see 13 ere 1S 
ree ; mF — FAX —27--F @ 7, ‘a 
Plw  ~«-7F-7Aa7-+F tL 
Figure 4. Real-address generation (ID-SCT type) 


Key: 1. Page Directory, Base Register 


2. Effective Virtual Space Number 3. Section Number 4. Page Number 5. 


Relative Address in a Page 6. Virtual Address 7. Address Translation Buffer 8. When unregistered 9. Virtual Address 
10. Main Memory Page Address 11. Registration 12. When registered 13. Relative Address in a Page 14. Real Page 
15. 4kB 16. DBT: Directory Base Table, WSPTD: Virtual Space Page Table Directory, SCT: Section Table, PT: Page 
Table, DBW: Directory Base Word, PTDW: Page Table Directory Word, PBW: Page Table Base Word, PTW: Page 


Table Word 





4. Conclusion 


'n this paper, we described the architecture of ACOS 
System 3900. The $3900 maintains compatibility with 
the $2000 and is a system with improved functions to 
speed up and increase capacity in future information 
management. 

As the system supporting the core of C&C, this system is 
assured to effectively respond to the wishes of users. 
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Figure 5. Real-address generation (DD-.) VT type) 
Key: |. Effective Virtual Space Number 2. Division Table Number 3. Section Number 4. Page Number 5. Relative 
Address on a Page 6. Virtual Address 7. Address Translation Buffer 8. When unregistered 9. Virtual Address 10. Main 
Memory Page Address 11. Page Directory, Base Register 12. Registration 13. When registered 14. Real Page 15. 
Relative Address on a Page 16. 4 kB 17. WSPTD: Virtual Space Page Table Directory, DVT: Division Table, SCT: 
Section Table, PT: Page Table, PT DW: Page Table Directory Word, SBW: Section Base Word, PBW: Page Table Base 
Word, PTW: Page Table Word 





ACOS System 3900 Hardware 1. Introduction 
94FE0465C Tokyo NEC TECHNICAL JOURNAL As the central system in an advanced communications 
in Japanese Feb 94 pp 11-19 and control (C&C) information society, ACOS System 


3900 ($3900) was developed to meet the need for super- 
[Article by Yuzo Ohmori, Computers Division; H. Ishii, large-scale computers with faster speeds, more reli- 
Computers Division; M. Kofuji, NEC Engineering Ltd.; —_ ability, and more functionality. 
M. Yamamori, NEC Systems Laboratory, Inc.; T. a 
Kanazawa, NEC Kofu, Ltd.; H. Ishizaka, NEC Engi- _ By utilizing new VLSI technology adopted for the SX-3 
neering Ltd.; T. Kimura, NEC Engineering Ltd.; §. Supercomputer, the $3900 achieved twice the perfor- 
Ohno, Computers Division; T. Morisada, NEC Kofu, | ™ance compared to a single processor in the older model 
Ltd.; M. lida, NEC Engineering Ltd.; T. Shigemasa, NEC | ACOS System 2000 (S2000) through a variety of hard- 
Engineering Ltd.; I. Hasegawa, NEC Kofu, Ltd.; K. Ware innovations. Furthermore, the maximum number 


Naito, NEC Kofu, Ltd.; T. Ogura, NEC Kofu, Ltd.; and Of execution processing units (EPUs) in a tightly coupled 


E. Oi, Computers Division] multiprocessor increased from the four units of the 
[Tent] $2000 to eight units. 
ext 
In addition, reliability is achieved by dramatically 
Abstract decreasing the number of parts by using V"SI and 


improving RAS (Reliabiiity, Availability, Serviceability) 
ACOS System 3900 is the high-end model of the ACOS-6 functions. 
System. It 1s a super-large-scale general-purpose com- 
puter that adopts the latest VLSI technology and System Configuration 
advanced high-speed technology. * © 


This paper presents an overview of the processing units 
that form the core devices in ACOS System 3900, their § The $3900 ranges from Model 10 having one execution 
features, and high-speed technologies. processing unit (EPU) configuration to Model 80 having 


Figure | shows the system configuration of the $3900. 
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Figure 1. Configuration of the ACOS System 3900 


Key: |. EPU: Execution Processing Unit, SCU: System 
Control Unit, MMU: Main Memory Unit, IOP: I/O 
Processor; EMU: Expanded Memory Unit 2. Model 10 
3. Maximum configuration 





an eight-unit EPU configuration. Moving to better sys- 
tems at the installation site (field upgrade) is possible 
between these systems. Thus, the needs in system per- 
formance can be flexibly addressed. 


Also, the new expanded memory unit (EMU) can be used 
in the $3900. By adding the EMU, system performance 
improves and a high-performance loosely coupled mul- 
tiprocessor system or a high-speed switch hot standby 
system can be configured. 


recall 


3. Execution processing unit (EPU) 


Figure 2 shows the logical structure of the $3900’s 
execution processing unit (EPU). 


The EPU decodes various instructions read from the 
main memory unit (MMU) and performs arithmetic and 
logical operations, executes various controls, and per- 
forms interrupt processing. Also, by adding the optional 
integrated array processor (IAP), vector operations can 
be rapidly executed. Since EPU performance is a major 
factor in setting the performance of the system, various 
technologies to speed up the EPU are being adopted. 


The logical structure of the $3900’s EPU is fundamen- 
tally based on the logical structure of the $2000’s, but the 
logical structure is changed by expanding the architec- 
ture, adopting new faster technologies, and optimizing 
new technologies. 


Next, the major changes are described. 


3.1 Pipeline control 


The execution of a instruction consists of the series of 
steps of reading instructions, decoding instructions, cal- 
culating operand addresses, reading operands, executing 
the operands, and storing the result. Pipeline control is 
designed to independently process each of these steps. By 
connecting these stages serially and having the result of 
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Figure 2. Logical structure of the EPU 

Key: 1. Cache 2. Address Translation Mechanism 3. Instruction Cache 4. Operand Cache 5. Read Request Buffer 6. 
Store Buffer 7. Main Control Memory 8. Variable Length Operation Control Memory 9. Priority Control Unit 10. 
Instruction Buffer 11. Instruction Decoder 12. Branch Prediction Mechanism 13. Instruction Address Adder 14. 
Operand Address Adder 15. Decoded Instruction Buffer 16. Operand Buffer 17. Execution Processing Unit 18. Basic 
Execution Processor 19. Operator Buffer 20. Program Register 21. High-Speed Remainder Operator 22. Floating- 
Point Adder 23. Variable Length Data Operator 24. Vector Operator Control Mechanism 
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Key: |. | Machine Cycle 2. Instruction 3. D: Instruction 
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the previous processing stage input to the next pro- 
cessing stage, this method simultaneously processes mul- 
tiple serial instructions in different stages. 


The 8-stage pipeline control shown in Figure 3 is 
adopted in the $3900. The $3900’s pipeline needs eight 
machine cycles to process one instruction. However, 
eight instructions can be executed in parallel by having 
the pipeline operate efficiently without any gaps. The 
result of one instruction can be obtained in each machine 
cycle. 


The primary factors hindering pipeline efficiency are 
delays in reading instructions accompanying changes in 
the instruction sequence due to branch instructions, 
delays in reading operands and instructions due to cache 
misses, and contention between operand and instruction 
reads in the cache memory. Through various logical 
innovations in the $3900, the decrease in pipeline effi- 
ciency caused by the above will be minimized. 


3.2 Improving the buffers 


(1) Improving the instruction buffer 


Since the instruction prefetch circuit and the operand 
prefetch circuit in the EPU operate asynchronously, a 
buffer is necessary for data transfer between these cir- 
cuits. When the buffer capacity is small, the problem 
arises of the inability to adequately supply instructions 
to the pipeline, thus the efficiency of pipeline operation 
falls. As in the $2000, the $3900 is equipped with a total 
of four instruction buffers: one for the stream of instruc- 
tions currently being executed and three for the stream of 
branch instructions. The capacity of the instruction 
buffers in the $3900 is double those used in the $2000 so 
the supply of instructions to the pipeline will proceed 
more smoothly. 


(2) Address translation buffer (TLB) 


The $3900 utilizes a virtual storage method capable of 
increasing the apparent memory space seen from the 
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software beyond the real memory. Translation from a 
virtual address in the virtual memory space to a real 
address in the real memory space references the page 
table in the main memory. To accelerate this address 
translation, an address translation buffer (TLB) that 
records the pair of the virtual address in the EPU and its 
corresponding real address is provided. 


The TLB is dedicated to instruction and operand use as 
in the $2000. Here, the major change is the two-stage 
configuration of the primary TLB (PTLB) and the sec- 
ondary TLB (STLB). STLB is constructed in RAM and 
corresponds to a conventional TLB. The PTLB stores a 
portion of the STLB’s contents, is constructed from a 
gate circuit, and can provide data (real addresses) at high 
speeds from the RAM. 


During address translation, the real address is provided 
by the PTLB. When the real address is not in the PTLB, 
it is provided by the STLB. When the real address is not 
in the PTLB nor the STLB, the page table in main 
memory is used to perform address translation. Conse- 
quently, TLB access time is shortened and address 
translation can be rapidly performed. 


3.3 Cache memory 


Similar to the $2000, the $3900’s EPU has cache mem- 
ories that operate independently for dedicated operand 
use or instruction use. Each cache memory has a capacity 
of 64 kB and accelerates instruction processing without 
contention between instruction and operand accesses 
generated while basic instructions are being processed. 


When the required data is not in the cache memory, data 
must be sent to the cache memory from the main 
memory unit. In the $3900, the transfer unit (block) was 
expanded from the 64 bytes (8 bytes x 8 times) in the 
$2000 to 128 bytes (16 bytes x 8 times) and the cache hit 
rate was improved. 


Each cache memory is divided into four logical parts 
with 16 kB as one control unit. As a result, concurrent 
operation is possible only when reading and writing in 
the same control unit are not in contention. The write 
operation does not wait for the read operation and 
high-speed data is furnished. 


3.4 Other acceleration measures 


The $3900’s EPU adopted acceleration technologies, 
such as the ones below, by adding the techniques 
described above in order to accelerate processing. 


(1) Branch prediction mechanism 


In pipeline control, branch instructions become major 
factors in disrupting pipeline flow. Branch instructions 
increase the access frequency; consequently, accelerating 
them is crucial to improving performance. Since the 
execution time of a branch instruction depends on the 
time to supply the instruction word preceding the branch 
to the pipeline, the instruction word preceding the 
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branch is rapidly provided in the high-speed execution of 
branch instructions and supplied to the pipeline. 


To accelerate the instruction preceding a branch in the 
§$3900, a branch prediction mechanism was provided. 


(2) Control memory partition 


The execution processing unit in the EPU operates based 
on control signals output by the microprogram read from 
the control memory. Furthermore, the control memory 
unit operates while referencing operator result informa- 
tion sent from the execution processing unit. In this way, 
the execution processing unit and control memory 
operate while being closely connected to each other. 
Control memory in the $3900 is divided into two parts: 
one to control variable-length data operators and 
another to control other operators. As a result, the 
physical distance between the control memory and exe- 
cution processing unit is shortened, the signal propaga- 
tion time is decreased, and operator processing can be 
rapidly executed. 
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(3) Accelerating frequently used instructions 


In variable-length data processing instructions like the MLR 
(Move Alphanumeric Left to Right), the innovation to speed 
up processing is adding dedicated circuits for cases with 
high usage frequencies. 

(4) Using a phase delay clock 

In the $3900, in addition to the normal clock, two types of 
phase delay clocks are used. Since a number of fluctuations 
arise because of circuit configurations in the signal delay 
time in each pipeline stage, the system clock is limited by 
the largest delay time of one of the clocks and acceleration 
becomes difficult. By using the optimum clock corre- 
sponding to the delay time in the $3900, a fast system clock 
was realized. 


4. System Control Unit (SCU) 


The system control unit (SCU) is the central device in 
the system. As shown in Figure 4, it consists of a system 
interface control unit, system configuration control unit, 
and diagnostic control processor (DGP). 
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To achieve a high level of expandability and flexibility, 
the SCU adopts a configuration capable of connecting 
modular processors in arbitrary ways. There can be a 
maximum of four EPUs, four IOPs, one MMU, and two 
EMUs in one SCU. In addition, a TCMP (tightly cou- 
pled multiprocessor) system connecting two SCUs can 
be constructed. 


4.1 System interface controller 


The system interface control unit efficiently controls 
memory access requests from the EPU and IOP and 
communication requests between processors. 


Technology similar to the ultra-fast, high-density tech- 
nology used in the EPU was used and one MCP (Multi 
Chip Package) was implemented. 


Memory access uses 128-byte block data transfer and 
maximum 64-way interlerving to realize high-speed 
memory throughput. 


By adding 4-byte units to the error correcting code (ECC) 
for memory, a 4-byie write is not a partial byte write, but is 
a full byte write that can be processed in one memory cycle. 
This results in improved memory throughput. (Partial byte 
write processing merges data reads from memory and write 
data. To write to memory again, two memory cycles are 
required.) Moreover, to accelerate control for communica- 
tion requests and synchronous control between processors, a 
high-speed synchronous controller and high-speed transmis- 
sion buffer were installed in the SCU. 


4.2 System configuration controller 


The system configuration controller centrally manages 
configuration control information and implements 
system configuration control rich in adaptability. 


The controller has implementations from a single pro- 
cessor system to assorted multiprocessors with up to a 
maximum of eight processors, functions to dynamically 
modify the operating state (i.e., installing and discon- 
necting equipment, uninterrupted switching of one OS 
and two OS), automatic disconnection of faulty devices 
from the operating system when failures occur, mainte- 
nance and repair of faulty devices concurrent with 
system the operation, and re-installation functions for 
repaired devices to the system operation. 


4.3 Diagnostic control processor 


As systems become larger and faster, fast diagnostic 
functions become necessary. 


An RAS support system having the intelligence to 
improve fault processing functions and maintenance and 
diagnostic functions is required. This is implemented in 
cooperation with the service processor. 
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The diagnostic control processor and each unit (EPU, IOP, 
MMU, EMU) are connected through a dedicated special- 
ized maintenance and a diagnostic interface. Through this 
interface, retry control during failures, logging of fault 
information, and execution control for diagnostics take 
place without disrupting other operating devices. 


5. Main Memory Unit (MMU) 


In the $3900, an ultra high-speed, one megabit static 
RAM is used in the main memory unit (MMU). A fast 
access time, high throughput, and large capacity main 
memory is offered. 


Three-dimensional packaging technology is adopted and 
compact units are implemented. 
5.1 Overview of the main memory unit (MMU) 


The MMU is connected to the system control unit. One 
MMU has a maximum memory capacity of 512 MB. By 
connecting two MMUs in the TCMP system, a main 
memory with a maximum of | GB can be created. 


5.2 Configuration of the main memory unit (MMU) 
Figure 5 shows the MMU’s logical structure. 
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Figure 5. Logical structure of the MMU 
Key: 1. System Control Unit (SCU) 2. Memory Unit 
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The MMU consists of a memory unit controller (MUC) 
and a maximum of eight memory units (MUs). 


In response to processing requests from SCU, MUC 
executes writing and reading operations for the MU. 
MUC utilizes the ultra-fast CML (Current Mode Logic) 
technology. This structure can rapidly access the MUs. 


MU is a memory array for storing information. A SIM 
(Single Inline Module) card loaded with | Mb of static 
RAM is utilized as the memory element and constructed 
in a 3D package. One MU has a maximum of 64 memory 
banks. Throughput was improved by parallel operation 
based on interleaved operation. 


Four bits form one group in the MMU. The ECC 
function that corrects any multiple error bits in one 
group and detects errors over two groups is adopted, 
thus, reliability is achieved. 


6. I/O Processor (IOP) 


6.1 Overview of the I/O processor (IOP) 


The I/O processor (IOP) is connected to the system 
control unit (SCU) and performs input and output 
between the main memory unit (MMU) and peripheral 
control unit (PCU) based on instructions frora the cen- 
tral processing unit (CPU). 


$3900’s IOP achieves a large-scale system configuration, 
a maximum of 512 channels (twice that in the $2000) per 
system for requests with a high I/O throughput, and a 
maximum overall transfer capability of 2 GB/s (five 
times that of the $2000) per system. 


The table lists IOP specifications. 





Table Specifications of the IOP 
Item Specifications 














Number of connected IOPs 4 units/SCU 
§ units/system 
Number of channels 64 channels/IOP 





512 channels/system 





Transfer speed of channel 
Overall transfer capability 


9 MB/s (maximum) 








2 GB/s (maximum) 





6.2 Features of the I/O processor (IOP) 


(1) Hierarchical control mechanism. IOP consists of 
channel management unit (IOPP) to manage the channel 
conditions and communicate at high speed with the 
CPU, a channel management unit (CMU)/channel data 
component (CDC) to control data transfers through the 
channels, and a channel unit (CHU) to control commu- 
nication with the PCU. By distributing processes to each 
unit in response to function and performance, I/O oper- 
ations are efficiently controlled. 


(2) Optical channels. The optical channel (OCH) is a 
channel for data transfer through optical fiber cable. 
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Figure 6. Logical structure of the IOP 
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Therefore, optical cables are smaller and lighter weight 
than cables used in conventional electrical channel and 
can be connected over longer distances to the PCU. The 
degree of freedom in the system configuration increases 
and a large-scale system can be easily configured. 


(3) VLSI CMOS. VLSI gate arrays with 177,000 gates are 
used in the channel. And more compact, lower power 
consuming, and more reliable units were implemented. 


6.3 Structure of the I/O processor (IOP) 


The IOP must communicate at high speeds with the 
CPU and MMU, and achieve parallel data transfer with 
multiple PCUs. 


Consequently, the functions for each process are distrib- 
uted. Control is hierarchical to enable each function to 
operate efficiently. 


Figure 6 illustrates the logical structure of the IOP. 


(1) Channel Management Unit (IOPP). In addition to 
managing channel conditions, this unit centrally man- 
ages the initiation and termination of data transfers in 
subordinate channels by instructions from the CPU. 


To communicate at high speeds with the CPU, the IOPP 
is constructed from high-speed logical elements. 
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(2) Channel Management Unit (CMU). A maximum of 
32 channels are controlled in accordance with the 
channel program by instructions from the channel man- 
agement unit (IOPP). 


To efficiently control the channels, a dynamic address 
translation mechanism and prefetch mechanism for the 
channel program are built-in. 


(3) Channel Data Component (CDC). Data transfer is 
executed by the main memory unit (MMU), channel unit 
(CHU), and optical channel (OCH) based on control 
from the channel management unit. 


It has a data prefetch mechanism based on pipeline 
control and can execute data transfers with the PCUs at 
high speeds. 


(4) Channel Unit (CHU). Data is transmitted and 
received by the PCUs via electrical signals in channels 
using electrical cables in data transfers with the PCUs. 


A large capacity data buffer and high-performance 
microprocessor are installed for efficient transfer data 
with the PCUs. The microprocessor controls the inter- 
face with the PCUs. 


(5) Optical Channel (OCH). The optical fiber cable is the 
channel that acts as the transfer medium for data. Data 
are transmitted and received as optical signals. 


The optical fiber cable is compact and lightweight com- 
pared to a conventional electrical cable and is capable of 
long distance transfers. As a result, by using optical 
channels, the degrees of freedom in the system configu- 
ration are increased and large-scale system configura- 
tions become easy to build. 


The following two methods are used to connect the 
optical channel and the PCUs. 


1) Direct connection. PCUs that can be directly con- 
nected to optical channels are directly connected by 
optical fiber cables. 


2) Connection via optical mode channels. When PCUs 
that can be connected to conventional electrical channels 
are connected to optical channels, they are connected via 
the optical remote channel (RCH) routes. 


(6) Optical Remote Channel (RCH). To efficiently 
transfer data between peripheral control units in an 
adapter used when conventional PCUs are connected to 
optical channels, large-capacity data buffers and high- 
performance microprocessors are installed. 


Since the interface with a PCU is controlled by a 
microprogram, the interface with each type of PCU can 
be flexibly handled. 


7. Expanded Memory Unit (EMU) 


Expanded memory is provided as one memory device 
constructed as a hierarchical memory in the $3900. 
Expanded memory capable of high-speed, large capacity 
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Figure 7. Logical structure of the EMU 
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accesses is memory that supplements the memory 
capacity of the main memory unit (MMU) and can be 
used as memory to supplement access performance in an 
auxiliary memory device. 


7.1 Overview of the expanded memory unit (EMU) 


The expanded memory unit (EMU) uses a 4-Mb CMOS 
dynamic RAM as the memory element. A high-speed, 
large-capacity expanded memory can be constructed 
with a maximum of 4 GB per EMU and a maximum of 
8 GB per system. 


7.2 Structure of the expanded memory unit (EMU) 
Figure 7 shows the logical structure of the EMU. 


The EMU is constructed from an expanded memory unit 
(EMU) and 16 memory units (EU). A maximum of two 
EMUs can be connected per system. 


Based on memory access requests from SCU, the EMC 
performs high-speed processing in 128-block data 
transfer to perform read and write operations with the 
EU. In addition, the EMC is equipped with four SCU 
interfaces. An LCMP (lightly coupled multiprocessor) 
system that shares expanded memory between a max- 
imum of four hosts can be built and a hot standby 
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method combined EMU is adopted. To achieve reli- 
ability in the system, flexible responses are possible. 


The EU is the memory array that stores data. One EU to 
a maximum of 16 EUs can be constructed. One EU 
consists of a 256 MB memory card. 


To complete the RAS function in the logic circuit, an 
ECC function for 2-bit error detection and 1-bit error 
correction, a built-in diagnostic function, and a memory 
scan function are used. In addition, the redundant oper- 
ation in the power module and dual cooling fans are used 
to achieve high reliability. 


By adding a dedicated battery to handle unexpected 
faults like power stoppages and momentary interrup- 
tions, memory data can be backed up. The backup 
targets for the battery are limited to the EU and the 
refresh control circuit in the EMC. Low power consump- 
tion is designed and more compact batteries and longer 
backup times are produced. 


8. System Operating Station, Service Processor, and 
System Support Processor 


8.1 System operating station and service processor 


(1) Configuration and control. The system operating 
station (OPS) and service processor (SVP) are devices 
that allow the operators and support personnel to 
operate and maintain systems. It has the following 
structure. 


Operator display with keyboard 

Status display keyboard 

Floppy disk 

Compact fixed disk 

System operating station and service processor con- 

trol unit 

¢ Operating panel (system power on/off switch, activity 
monitor, etc.) 

e Emergency power off (EPO) switch 

e Maintenance communication circuit (ALIVE) con- 
nection mechanism 

e Uninterruptible power timer mechanism 


The following devices can be connected as peripherals. 


¢ Supplemental operating station 
¢ Serial printer 

¢ Remote display 

e Time synchronization unit 


The system operating station and service nrocessor con- 
troller are installed in the basic unit record processor and 
takes on the three roles of system operating station 
control unit, built-in system support processor, and 
service processor. In addition, the controller has one 
high-performance 32-bit microprocessor and one 16-bit 
microprocessor. It efficiently uses the resources of the 
system operating station and the service processor. Fur- 
thermore, high reliability is realized by using LSI for 
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error correction in the memory and having redundancy 
in the controller, devices, and power supply. 


Figure 8 illustrates the structure. 


(2) Functions. The system operating station is connected 
to devices like the I/O processor (IOP) and the system 
control unit (SCU) and has advanced intelligence to 
centrally control the whole system. System operation 
start-up control, interaction with the operator, and dis- 
plays of the system’s operating conditions take place via 
the interface with the I/O processor. Also, maintenance 
and diagnosis of core units are performed through the 
interface with the system control unit. The on/off power 
sequence, disconnection and emergency power off indi- 
cation of the system are performed by the power control 
interface. 


Each function is described below. 


1) Operating panel. This is constructed from various 
switches and display lamps, has power on/off and dis- 
connection indicators and system start-up indicators. It 
is primarily used at the beginning and end of system 
operation. 


2) Activity monitor. Since the usage rates of the execu- 
tion processing unit (EPU) and the I/O processor (IOP) 
are displayed as bar graphs on the operating panel, the 
user can easily determine the system’s operating condi- 
tions. 


3) Operator display. In addition to sending input data 
keyed in at the keyboard to the CPU, messages from the 
system are displayed on the CRT display. 


4) Status display. The job processing conditions and the 
usage status of system resources are displayed on the 
same CRT as the operator display. When the operator 
display is down, this function can be switched to. 


5) Floppy disks and compact fixed disks. Files are used 
for the control programs of system operating stations, 
service processor, and built-in system support processor, 
for testing and diagnosing the CPU, and for storing 
various log information. 


6) Serial printer. Hard copies of the display content for 
the operator display on the system operating station, 
Status display, and auxiliary operating station display 
can be obtained. The operator can use them to check the 
operating status of the system. 


7) Auxiliary operating station. This consists of a key- 
board and dispiay similar to the system operating sta- 
tion. By using this operating station, a multiple console 
system can be built. 


8) Remote display. Since the display content of each 
display can be shown unmodified, the remote display is 
used to manage the operating conditions of the system in 
locations ape t from the computer room such as a 
monitor room. 
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Figure 8. Logical structure of the OPS, SVP, and SSP 
Key: |. Basic Unit Record Processor 2. SVP Controller 3. SVP Controller 4. Maintenance Panel 5. Auxiliary Operating 
Station 6. Remote Display 7. System Operating Station 8. Operating Panel 9. Display/Keyboard Controller 10. 
Display/Keyboard Controller 11. Serial Printer 12. Remote Display 13. Remote Display 14. System Operating Station 
15. Operating Panel 16. Display/Keyboard Controller 17. Display/Keyboard Controller 18. Serial Printer 19. Remote 


Display 20. Auxiliary Operating Station 





9) Uninterruptible power timer and time synchroniza- 
tion device. A battery is installed and the year, month, 
day, hour, minutes, and seconds are sent to the calendar 
clock of the CPU at the start-up of system operation. If 
connected to the time synchronization device, the cal- 
endar clock automatically matches Japan Standard Time 
(JST). Synchronization of the calendar clocks of the 
multiple CPUs needed to build a hot standby system 
becomes possible. 


10) Service processor and maintenance communication 
circuit (ALIVE) connection mechanism. This has a 
system controller and interface and maintains and diag- 
noses the execution processing unit, main memory unit, 
and I/O processor. Its primary functions are test diag- 
nostics of the CPU, troubleshooting support using scan 
pass, error logging of the CPU, and simulating the 
maintenance panel. 


By using the maintenance communication circuit con- 
nection mechanism, these functions can be used by 


expert maintenance technicians in each specialty who 
are permanently stationed in the service center. Support 
requiring advanced technological strength can be quickly 
offered to every site. 


8.2 System support processor 


(1) Structure and control 


There are two types of system support processor (SSP), a 
built-in model and stand-alone model. The built-in 
model is used in the single/dual system and the stand- 
alone model is used when the system has a loosely 
coupled multiprocessor (LCMP) structure. 


The system support processor centrally manages the 
power source control in each device in the system. 


In addition, the automatic operation controller (AOC) 
can be connected as a supplemental mechanism. Auto- 
matic power control by supplying and disconnecting 
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system power including secondary equipment or the 
timer, and environmental monitoring of the machine 
room can be implemented. 


(2) Functions 


1) Power control. Supplying or disconnecting 
power, monitoring the power conditions, displaying 
abnormal power conditions, and collecting error logs of 
abnormal equipment for the whole computer system or 
for each device can be implemented. The voltage margin 
of each unit can be set and faulty equipment can be 
discovered early. 


2) Maintenance communication circuit (ALIVE) 
connection mechanism. In systems having multiple ser- 
vice processors, remote maintenance of the whole system 
can be implemented in one circuit connected to the 
maintenance communication circuit from the stand- 
alone system support processor. 


9. Conclusion 


We presented a brief introduction centering on technol- 
ogies for accelerating hardware in ACOS System 3900. 


The demand for high performance and advance func- 
tionality for super-large-scale computers will increase as 
the multifaceted advanced information society develop. 


In the future, we will work to respond to user needs. 


RAS Technologies for the ACOS System 3900 


94FE0465D Tokyo NEC TECHNICAL JOURNAL 
in Japanese Feb 94 pp 20-25 


[Article by Nobutaka Fujiwara, Computer Division; K. 
Shinohara, NEC Software Hokuriku, Ltd.; T. Shigeta, 
NEC Software Hokuriku, Ltd.; S. Nasu, NEC Software 
Hokuriku, Ltd.; H. Gotoh, NEC Software Hokuriku, 
Ltd.; S. Koizumi, NEC Software Hokuriku, Ltd.; T. 
Ueda, NEC Software Hokuriku, Ltd.; Y. Taya, NEC 
Software Hokuriku, Ltd.; and T. Hashiguchi, Computer 
Divisicn] 

[Text] To cope with social needs regarding highly reliable 
computers, in addition to existing technologies, ACOS 
System 3900 employs the latest technologies extending 
from the circuit level to the system level. 


This paper describes the RAS technologies of ACOS 
System 3900, including the technologies of error check- 
out at failures, automatic recovery due to error correc- 
tion and retrial, reconfiguration to disconnect failed 
hardware, and maintenance diagnostics for rapid failure 
recovery and a fault tolerant system. 


1. Introduction 


The essential functions and performance possessed by a 
computer system must be satisfactorily demonstrated, 
moreover, the system must be robust in terms of main- 
tenance when confronted with failures. In other words, it 
is crucial to have few failures, to maintain system 


19 


operation by appropriate failure processing in the rare 
event of a fault, and to design for the rapid repair of 
failures and the rapid recovery of the system. The 
concept of RAS encompasses these high reliability tech- 
nologies and is the acronym for Reliability, Availability, 
and Serviceability. 


Figure | shows the failure recovery procedure of RAS 
technologies in ACOS System 3900 (S3900). 


First of all, RAS technologies are based on equipment 
and systems that do not easily break down. The $3900 
employs the most advanced technologies in the world of 
high-speed LSI, high-speed RAMs, and large-scale high 
density packages for LSI. Together with implementing 
highly integrated parts, a major reduction in the number 
of parts used and a reduction in wiring between parts are 
designed. In addition, by adopting direct liquid cooling 
method VLSI that efficiently cools high density mounted 
devices and a redundant structure in the power source 
moduie, high reliability is achieved. 


Since high reliability is being realized at the parts level, 
if by chance a failure occurs, sufficient measures must be 
considered for each device and the total system. There- 
fore, the detection of errors generated by faults is neces- 
sary. Next, automatic recovery is performed by 
correcting the detected errors and retrial. When this 
automatic recovery does not end successfully, the failed 
part is disconnected and replacement equipment is 
switched to, then the system is retried. Although func- 
tions and performance partially degrade, the effect of 
failures is minimized and system operation proceeds. 
Thus, high fault tolerance is pursued. 


Based on superior maintenance and diagnostics tech- 
nology represented by built-in diagnostics (BID), failed 
parts can be correctly identified and rapidly repaired. 


Improving operability by one-dimensional maintenance 
tools from the service processor (SVP), implementing 
coordinated maintenance through remote maintenance, 
and improving maintenance by adopting powerful data 
collection methods are planned. 


RAS technologies fostered in the conventional ACOS 
system and the SX supercomputer are further developed 
in the $3900. 


Along with introducing each technology of error detec- 
tion, automatic recovery, reconfiguration, and mainte- 
nance diagnostics, the $3900’s fault tolerant system 
structure is introduced as an example response to the 
demand for higher reliability. 


2. Error Detection 


As error detection methods for failure generation in the 
$3900, in addition to parity checks, parity prediction 
checks, and 1 out of N checks, a method to match the 
outputs from duplicate circuit block levels is adopted 
and reliable error detection is performed. These detec- 
tion circuits are positioned anywhere in the equipment 
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Key: 1. Preventive maintenance 2. Normal operation 3. Degraded operation 4. Fault location diagnostic in 

disconnected device 5. Repair of disconnected device 6. Installation of disconnected device 7. Success 8. Fault 
generation 9. Fault detection 10. Automatic correction 11. Failed 12. Automatic retry 13. Disconnection of fault 
location 14. Switch to replacement device 15. Not possible 16. Automatic reconfiguration 17. Fault location diagnostic 
18. Fault location repair 19. Restart after repair 20. R: Hard to fail 21. A: When fault occurred, failure processing 
performed v + -~icrcaes continues. 22. S: Fault location identified, rapid repair and maintenance performed. 
23. Change of status 





that makes up the system in order to optimize the 
various circuits in the equipment. 

To simplify the explanation of the error correction, it is 
important to prevent error propagation and to localize 
errors spatially and temporally in order to increase the 
retry success rate. In addition, through localization, the 
detachment unit during reconfiguration becomes small 
and the effect on the system is reduced. 


Therefore, the result of error detection is expressed by 
the error indicator flip-flop (EIF) and becomes the target 
of error logging. 


3. Automatic Recovery 


In a broad sense, automatic recovery from errors 
includes reconfiguration to be explained later, but here it 
indicates that the error itself is automatically recovered 
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from and is not accompanied by degradation of the 
functions or performance. Automatic recovery is classi- 
fied into correction technology and retry technology that 
repeats the same actions. Here, the automatic recovery 
technology implemented in the $3900 is introduced. 


3.1 Error correction 


The control memory for the execution processing unit 
and I/O processors can have 1-bit error correction and at 
least two bits for error detection (SEC/DED: Single bit 
Error Correcting/Double bit Error Detecting). Addition- 
ally, in the main memory unit, the checking method is 
implemented where the automatic corrections of neigh- 
boring 4-bit errors (1 block error) is performed and dual 
neighboring 4-bit errors (2 block error) are detected 
(S4EC/D4ED.: Single 4-bit-byte Error Correcting/Double 
4-bit-byte Error Detecting). 


3.2 Error retry 


Faults are broadly classified into permanent faults and 
intermittent faults. In recent years, as technology has 
become faster and more highly integrated, the per- 
centage of intermittent faults has increased. If a fault is 
momentary due to noise (intermittent fault), by 
repeating the affected operations, the error can be elim- 
inated and processing can continue. 


Also, as LSI becomes more integrated, soft errors in 
RAM due in particular to a-rays can no longer be 
ignored. ECC correction of 1-bit errors, rewriting of 
error words, and retrying instructions are performed to 
correct 1-bit errors in RAM. As one link measures 
against RAM soft errors, methods to remedy frequent 
errors in the equipment are divided into retries in the 
same equipment, degradation in faulty parts of RAM at 
faults in the same place in the same equipment, and 
retrying instructions in replacement equipment (proces- 
sor relief) for the third error in eight hours. For faults 
outside the RAM, processor relief takes place the first 
time. 


For errors related to I/O equipment, firmware installed 
in peripheral equipment retries at the command level. 
When the retrial fails, the operating system (OS) retries 
I/O instructions. When both fail, the OS switches the I/O 
path, exchanges to replacement equipment to avoid the 
error and a retry is executed. 


Consequently, in the rare event of a fault, when the retry 
is a success, the error has almost no effect and processing 
continues. High operability is maintained. 


4. Reconfiguration 


Similar to the conventional ACOS system, by replicating 
modules with the same structures in the $3900, redun- 
dancy is maintained. When automatic recovery is not 
possible, the fault location is disconnected and system 
operation continues in the degraded form. 
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See “ACOS System 3900 Hardware” in this issue for 
information on the system structure of the $3900. 


4.1 Reconfiguration at the circuit level 


Reconfiguration at the circuit level in each device is 
possible in the execution processing unit and main 
memory unit. There is a separate 64-kB cache memory 
for instruction use and for operand use in the execution 
processing unit. These cache memories are composed of 
four levels. Degraded operation is possible in each level. 
Consequently, even when processing efficiency falls, 
calculations and control can proceed. 


The main memory unit is composed of a maximum of 
eight memory units (MU) per device where an MU has a 
capacity of 64-MB. Memory can be reconfigured in these 
MU units. Furthermore, reconfiguration in page units is 
possible in the $3900. When permanent errors that 
cannot be corrected were detected in read out requests 
for memory, the page (4 kB) containing the corre- 
sponding physical address is disconnected. Therefore, 
operation degradation in MU units or page units 
becomes possible. 


Failures in the channels or peripheral control devices can 
degrade the paths. 


4.2 Reconfiguration at the device level 


System degradation in devices with redundant structures 
basically detaches faulty device units to improve system 
operability. In addition, when faults were detected at 
system start-up, degraded processing is performed and 
only normal hardware transfers to the OS and OS 
start-up proceeds normally. 


Processor relief accompanies the disconnection of the 
execution processing unit as reconfiguration at the 
device level. An instruction retry as a fault tolerant 
technology is effective for intermittent faults, but it is 
not effective for permanent faults. In contrast, processor 
relief is an effective measure as the fault-tolerant tech- 
nology during permanent faults in execution processing 
units. 


When faults that enable instruction retries are generated, 
the $3900’s processor relief does not retry instructions in 
the failed execution processing unit, but transfers the 
status of the failed device to another normal execution 
processing unit and processing continues in that unit. 
This addresses the case where instruction retries fail in 
the failed execution processing unit and enters a state 
that cannot retry instructions. 


When a permanent fault occurs in the I/O path of an I/O 
processor or peripheral device, system processing con- 
tinues by switching to a replacement path and the fault 
tolerance of I/O processing improves. 


Even in a single SCU system in the $3900, dual SVPs 
and fault tolerance of the maintenance function are also 
considered. Moreover, the power source is divided. 
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Through the addition of a redundant power source by 
this division, reliability in the power source is achieved. 


Although the fault tolerance technology introduced 
above is provided and system operability improves, in 
the rare event the system goes down, the RAS control 
software (TOMAS, to be explained later) in the $3900 
disconnects failed hardware and automatic reconfigura- 
tion processing that restarts the OS is provided. A 
reduction in the restart time is devised. 


5. Maintenance Diagnostics 


When faults are generated and devices or the system 
cannot be used, rapid repair to restore the normal 
operation of the system is required. Therefore, only the 
faulty device is disconnected and degraded operation 
occurs. Maintenance is performed concurrently with OS 
operation. Further, by using a one-dimensional mainte- 
nance tool from the SVP, improved maintainability is 
planned. Figure 2 shows the maintenance system. 


By applying remote maintenance in response to need, 
support is determined by the maintenance technician at 
the remote service center and comprehensive mainte- 
nance is performed. 


5.1 RAS processor 


In the $3900, the RAS processor function is imple- 
mented in the diagnostic control processor (DGP) and 
the SVP in the SCU. Also, the RAS function is imple- 
mented by the RAS processor, each device and the OS. 
Since the RAS processor operates independently of the 
OS, normal tasks are not disrupted. 


Based on the control by TOMAS (Total On-line Main- 
tenance And Service system), software in the DGP, 
reconfiguration, error logging, and test diagnostics are 
executed. During the execution of the test diagnostics, 
TOMAS disconnects the failed device from the OS and 
creates an environment where the failed device can be 
accessed independent of the OS. In this environment, the 
TOLTS (Total On-Line Test and Diagnostic System) test 
diagnostics program runs controlled by TOMAS or the 
SVP. Since the test diagnostic program operates even 
while the OS is operating (of course, even when the OS is 
down), it is called on-line test diagnostics. The SVP 
function is installed at the operating station. The SVP 
passes through the DGP and is connected at the dedi- 
cated diagnostic interface that is not used during normal 
processing to each device (EPU, MMU, IOP, SCU, 
EMU) that makes up the system (Figure 3). By control- 
ling the scan paths where registers and flip-flops are 
connected into a shift register configuration, reading out 
internal data about the hardware of the system's devices 
and setting and resetting are performed. Thus, each 
device in this system collects detailed data and performs 
diagnosis. 


5.2 Error log collection 


Of course, there is an error log of system devices in the 
$3900. Power fault errors are collected by the system 
support processor. Then based on the SVP, one- 
dimensional management is performed. Additionally, 
the hardware address in the device is provided and 
operating history data in the device until the fault 
occurred is collected, so hardware operation can be 
traced in detail. Since a CLU (CooLing Unit) fault may 
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3. RAS 


and diagnostic interface 


Figure 
Key: 1. Another DGP 2. Another SCU 3. Another SVP 4. RAS Processor 5. Service Center 6. Diagnostic Interface 7. 
Normal Interface 8. Execution Processing Unit 9. System Control Unit 10. Main Memory Unit 11. Expanded Memory 
Unit 12. Input/Output Processor 13. Diagnostic Control Processor 14. Service Processor 15. System Support 


Processor 





result in serious problems in the system, there is a 
function that collects the water level and temperature 
data of cooling water in CLU in the SVP. CLU data 
collected in the SVP can be edited and output in 
response to need. 


5.3 Diagnostics 

To restore a failed device, the fault location must be 
identified. As computers have become faster the tech- 
nologies used have become faster and more highly inte- 
grated. Consequently, the generated faults have low 
reproducibility, in other words, intermittent faults are 
becoming prevalent. In contrast to fault detection, there 
are unsatisfactory aspects to methods that execute a 
diagnostic program again. That is, initially, data at the 
time an error was detected (i.e., error log data) is used 
and diagnosis must be performed. Therefore, based on 
the BID diagnostic method that immediately and auto- 
matically identifies the fault location in the $3900, an 
effective method is adopted for permanent faults as well 
as intermittent faults. In identifying suspected faulty 
parts, the suspicion probability is matched and displayed 
and fault repair can be rapid. Additionally, faulty LSIs 
are also identified and the correct repairs take place at 
the LSI level. Furthermore, identifying fault locations 
even in the power source log is automatic. 


5.4 Concurrent maintenance and prediction maintenance 


When a system becomes large, the possibility for 
degraded operation increases where only the faulty 


device is dc.ached from the system and operation pro- 
ceeds. In this state, in parallel with the task operations, 
test diagnosis is possible for the device that failed during 
system operation so that the failed device can be 
repaired. In order to exchange failed parts, power source 
control is required, but a unit for power source control is 
set in each device in the $3900. Therefore, replacing 
failed parts becomes possible while the system operates. 
Verification after replacing failed parts is performed by 
the TOLTS test and a diagnostic program capable of 
operating in parallel with the tasks. After the repair, the 
device verified to be normal is included in the system 
and the system operates normally. 


Before the devices and system become faulty, prediction 
and planned maintenance that switches normal parts in 
advance are important to improve the system’s opera- 
bility. As a result, a margin test function is provided to 
test the limit conditions of devices and to discover the 
symptoms of faults. 


5.5 Remote maintenance 


The service center that has a wealth of data related to 
remedies for various faults can be connected to the 
$3900 via public circuits and the ALIVE (an on-line 
versatile maintenance system) remote maintenance can 
be operated from the service center. Therefore, high- 
level, rapid maintenance is possible by a specialist and 
the MTTR (Mean Time To Repair) is reduced. The 
$3900 has an automatic notification function that auto- 
matically notifies the service center in response to the 
level of the fault or failure and automatically sends 
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needed failure data to the service center when a fault or 
failure occurs in the system. In the ACOS system, a 
maintenance database is constructed for maintenance 
parts stock management to ensure the rapid supply of 
maintenance parts and failure data management to 
search failure data. 


These functions are generally referred to as the total 
maintenance service TRUST (Totalized Remote Main- 
tenance and User Support Tools) and offer coordinated 
remote maintenance service. 


6. Fault-Tolerant System 


Faster and more complex computers are coming into use. 
Moreover, our societal activities are deeply entwined with 
computers. Consequently, the effect on society when the 
service of a computer system stops is incalculable. Given 
this situation, reliability is demanded as in the past. In 
other words, the desire is to construct no downtime, 
non-stop-oriented computer systems. 


In the field of general-purpose, large-scale computers, to 
respond to these needs, many users have adopted the hot 


standby system based on loosely coupled multiprocessors. 


Even the $3900 offers a hot standby system. The devel- 
cpment model of the hot standby system does not 
immediately stop the system in use and switch, even 
when a failure occurs in the current on-line system. The 
system currently in use continues to operate and the 
chance of abnormal termination of the on-line system 
currently in use can be used to switch to a standby 
on-line system. As a result, the hot standby system 
structures become the mutual standby model and the 
load distributed model. A variety of fault-tolerant sys- 
tems can be constructed in response to user’s needs. 


Figure 4 shows the configuration of the $3900’s fault- 
tolerant system. 


7. Conclusion 


Above we described the RAS technologies of the ACOS 
System 3900. 


As the operation of computers becomes faster and more 
complex, better reliability is demanded of computer 
systems. Therefore, RAS technologies are cultivated in 
the conventional ACOS system, supercomputers are 
being developed and higher reliability is achieved. 


Moreover, by building the mutual standby and load 
distributed hot standby system, which was developed 
from the hot standby system based on a loosely coupled 
multiprocessor system to implement a reliable no down- 
time-oriented system, a fault-tolerant system can be 
offered that meets user’s needs. 


The $3900 is assured to be a large-scale system for the 
future that responds well to the user’s desire for a high 
level of reliability. Still we would like to offer a system 
that more accurately reflects the desires of users. 
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Abstract 


In step with the evolution of large, on-line real-time 
systems and Strategic Information Systems, the role of 
mainframes, which are capable of storing, exchanging, 
and processing enormous amounts of data with high 
performance and high reliability, remains important in 
the distributed processing environment. 


The ACOS-6/NVX operating system makes it possible to 
utilize the capability and performance of the large main- 
frame computer ACOS System 3900 so as to build very 
advanced information systems. 


1. Introduction 


The ACOS-6 system has been highly acclaimed as a 
super-large-scale system since its announcement. As well 
as using high performance hardware, and the main 
specifications of multiprocessor, main memory, and 
virtual memory in wie hardware architecture, the ACOS- 
6, ACOS-6/MVX, and ACOS-6/MVXII operating sys- 
tems effectively use this hardware architecture. 
Multidimensional processing from on-line systems like 
TSS-AF and TDS-AF to batch processing is efficiently 
executed. Additionally, high reliability and high opera- 
bility that are indispensable in a super-large-scale main- 
frame are offered. 


Nonetheless, trends in recent information systems are 
not limited to the rapid expansion of data capacity, but 
distributed and standardized systems have accompanied 
the evolution of network technology and the prolifera- 
tion of workstations and personal computers. 


The form of traditional computer processing builds 
user-specific task programs on the functions and inter- 
faces specific to the computer model. The system scale 
and the data handled were localized and centralized. 


Within this processing form, fields demanding high 
performance and high reliability such as banking systems 
and production management systems, as well as applica- 
tions that rely on computers requiring high-capacity data 
storage and complex data processing ability of informa- 
tion analysis for strategic support in corporations, are 
developing. Traditionally, the construction of large-scale 
databases and accelerating data processing in computer 
systems required high reliability. 

In contrast, changes in the business conditions supported 
by mainframes have accompanied the spread of low cost, 
high-performance workstations and personal computers. 
Business-specific data processing such as information 
systems tasks built on traditional mainframes are 
adopting distributed processing built on workstations or 
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personal computers. Mainframes present in this type of 
distributed environment assume the roles of large-scale, 
high-performance database servers and high-capacity 
data storage. 


A major trend is the advance of open systems for 
connecting the workstations and personal computers of 
different manufacturers and compatibility with multiv- 
endor systems. In mainframes, strengthening connec- 
tivity and cooperative processing with these open 
systems are sought. 


ACOS-6/NVX was developed as the latest operating 
system supporting the super-large-scale mainframe 
ACOS System 3900 that must meet these market needs. 
The first version R1.0 was shipped in March 1992 and 
R2.0, in March 1993. We introduce their features in this 


paper. 


2. Overview of ACOS-6/NVX 


ACOS-6/NVX is developed based on the following six 
concepts (Figure 1). 


(1) Large scale and high performance. Based on the 
ACOS extended system architecture XSA (extended 
system architecture) that achieved high-speed processing 
of large quantities of data, specifications and a perfor- 
mance range are implemented which are suited to large- 
scale data servers of large-scale on-line transaction 
system distributed processing systems as the foundation 
of the 1990s information systems. The practical technol- 
ogies of the XSA architecture are explained later. 


1. Tightly coupled multiprocessor system based on a 
maximum of eight execution processing units (EPU) 


2. Main memory unit with a maximum capacity of 1 GB 
(gigabyte) 


3. Expanded memory unit (EMU) having a maximum 
capacity of 8 GB capable of using as an ultra high-speed 
file device or I/O buffer expansion region 


4, Virtual memory space of a maximum of 2°? bytes (4 
petabytes) 

5. Large capacity/high-performance peripheral equip- 
ment like 49-GB magnetic disks and 4-GB electronic 
disks and file space on the order of 2°°-bytes (terabytes) 
(2) High reliability 


The following highly reliable technologies are offered to 
form the basis for building a nonstop system. 


1. Host standby system configuration (MSRF) 


2. Disk dual function by software (compatible with 
LCMP) 


3. Advanced RAS functions (i.e., processor relief) for the 
main system and peripheral devices 


(3) Relational database 
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Figure 1. Concepts of the ACOS-6/NVX 
Key: |. Large-Scale and High Performance 2. ACOS extended architecture XSA 3. High-performance on-line 
transaction processing 4. Flexibility and extensive expansibility 5. High Reliability 6. Nonstop oriented 7. Data saving 
measures 8. Open Systems 9. International standard compliance 10. UNIX connectivity 11. Multivendor systems 12. 
Relational Database 13. Compatible with core database 14. International standard compliance 15. High Operability 
16. Efficient and automated storage management |7. Automated and labor saving operation 18. High Productivity 19. 
Improved productivity of system development 20. High productivity language and development support system 





RIQSII V2, the relational database forming the core 
database in the large-scale system, is offered. It has the 
following features. 


1. Major extensions of database specifications 


2. Various storage modes and fast access methods pos- 
sible in optimum database design corresponding to data 
characteristics 


3. High-speed engine for the international standard 
database language SQL 


4. End-user language (TQFII) and database server func- 
tion (RIQSII server) to support end-user computing 


(4) Open systems 


To be able to build any distributed computing environ- 
ment that overcomes the barriers between manufac- 
turers and different models, an international and com- 
mercial standard communication protocols are 
supported. Its features are as follows. 


1. Cooperative processing with the UNIX' system that is 
spreading rapidly is given serious consideration. The 
virtual terminal function (TELNET) and the file transfer 
function (FTP) of the TCP/IP protocol, a job transfer 
function (NQS?), a file server function, and a print 
server function are offered (Figure 2). 


2. The file transfer access function (FTAM) and the 
electronic mail function (MOTIS/MHS) in the interna- 
tional OSI protocol standard are offered. 


3. The job transfer functions (RJE/S3770, NJP/S377011) 
in the SNA? protocol, the SNA3270 terminal support 
function of TDS-AF, and the SNA3270 support function 
of TSS-AF (ETOS/SNA3270 gateway function) are 
offered. 


(5) High operability 


To decrease the operating costs that increase as the 
system matures, operation automation and labor-saving 
functions are offered. These have the following features. 
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Figure 2. Cooperation With the UNIX System 
Key: |. High-Capacity Storage 2. High-Speed Printer 3. 
ACOS-6/NVX UNIX Cooperative Processing 4. High- 
Speed Optical LAN FDDI (C&C-NET LOOP 6780) 5. 
UNIX Workstation 6. UNIX Machine 7. Virtual ter- 
minal: TELNET client/server 8. File transfer: FTP cli- 
ent/server 9. Job transfer: NQS client/server 10. File 
server: UX file server 11. Print server: UX print server 





1. Unified Storage Management (USM) to efficiently 
and automatically manage storage in large-capacity, 
diverse file devices 


2. Cartridge library device to automate the media 
mounting operation of cartridge magnetic tape 


3. Automatic Operation Function (AOS) corresponding 
to distributed systems 


(6) High productivity 


A software development support system is offered to 
design higher productivity into the whole software life 
cycle for expanding application program development. 
This has the following features. 


1. Development methodology (STEPSII) supporting 
consistency in the whole process 


2. Unified CASE (CASEWORLD) provides an inte- 
grated support environment encompassing analysis and 
design to automatic program generation, tests, opera- 
tion, and maintenance, and document environment in 
cooperation with functions in the EWS4800 Series 
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3. XSA Architecture Application 


For the ACOS-6/NVX to be a large-scale, high- 
performance database server and to improve on-line and 
batch processing power, the XSA architecture is 
employed and various accelerating functions are offered. 


(1) Compatible with extended address architecture 
ACOS-6/NVX is compatible with the XV (extended 
virtual address) mode that has the 52-bit (4 petabyte) 
addressing capacity which is a feature of the XSA 
architecture. 


(2) Extension of hardware specifications System 3900 
can be connected to a maximum of eight execution 
processing units (EPU), eight I/O processors (IOP), and 
512 I/O channels. 


Also, up to 1,536 magnetic disks/electronic disks (2 
terabytes) can be connected as file storage devices. 


(3) Support of expanded memory units (EMU) 


In System 3900, in addition to the main memory unit 
with its maximum capacity of 1 GB, an expanded 
memory unit having a maximum capacity of 8 GB and 
transfer capability of 490 MB/s is utilized. 


In the hierarchical memory management in ACOS- 
6/NVX, an EMU is installed between the main memory 
unit and an electronic disk unit or a disk cache (Figure 


3). 


As explained below in (4), from the perspective of 
memory capacity, the EMU has the effect of expanding 
the effective capacity of the main memory unit, and 
from the perspective of transfer capacity, improves the 
effective data transfer performance of file storage 
devices. The EMU’s performance gap with the pro- 
cessing power and I/O transfer capability of the high- 
performance EPU is eliminated and the EPU’s pro- 
cessing capability can be effectively used (Figure 4). *'V 


(4) Acceleration functions offered by ACOS-6/NVX 


ACOS-6/NVX uses extended addressing and the EMU 
described earlier. By replacing the I/O processing that 
becomes the bottleneck in system performance with high 
capacity buffer memory and high-speed file storage, 
functions to accelerate various data processes are 
offered. 


These functions make possible file I/O performance 
several ten to several hundred times faster than systems 
using conventional magnetic disks and electronic disks. 


1) Expanded data management 


Expanded data management uses the EMU and pos- 
sesses the following function elements as the basic func- 
tions to improve processing performance related to file 
access (Figure 5). 


(a) EMU file 
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Figure 3. Unified Storage Management/Hierarchical Management (USM/HM) 
Key: i. Cache 2. Main Memory 3. Expanded Memory 4. Memory Space 5. Hierarchical Memory Management 6. 
Electronic Disk 7. 1 GB/unit 8. Magnetic Disk 9. 49 GB/unit 10. Cartridge Library 11. 1,250 GB/unit 12. File Space 
13. Data Memory 14. Erasing the CPU and I/O performance gap by the expanded memory unit 15. Maximum 8 GB, 
Transfer speed of 400 MB/s 16. Improving system performance by speeding up I/O 17. High-speed peripheral devices 


and hierarchical memory management 
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Figure 4. Utilization of the EMU 

Key: 1. Main Memory Unit 2. Effective Capcity Expan- 
sion 3. Expanded Memory Unit (EMU) 4. Real page 
utilization: Shared buffer control (BCIII) 5. External 
page utilization: EMU paging 6. File storage device 
utilization: EMU file 7. Improved Effective Transfer 
Performance 8. Auxiliary Memory Unit (Magnetic Disk/ 
Electronic Disk) 





In the ACOS-6/NVX, EMU can be used as a file storage 
device similar to a conventional magnetic disk and 
electronic disk. A file created in the EMU is called an 
EMU file. An EMU file can be used as a temporary file 
or a permanent file. Furthermore, when a permanent file 
is stored, to ensure the maintainability of data, a battery 
backup mechanism is supported in the EMU. 


A user can create an EMU file as a magnetic disk file. 
Without changing the standard I/O interface from the 
application program, the EMU is only selected as the file 
unit in the external parameter of JCL. The effect of the 
high performance possessed by the EMU can be used. 


In data transfer between EMU and the main memory 
buffer, ACOS-6/NVX dynamically selects synchrenous 
transier (direct transfer method) by the EPU when 
requests contain a small quantity of data and asynchro- 
nous transfer (indirect transfer method), by the IOP 
when requests containing a large quantity of data. By 
dynamically selecting this transfer method, a suitable 
load balance of the EPU and IOP is maintained and high 
sysiem throughput is obtained. 


(b) BCIII 


BC (Buffer Control) is the buffer control mechanism of 
the file access mechanism UFAS (Unified File Access 
System) offered by ACOS-6/NVX and provides simulta- 
neous access control of databases and a file library 
control mechanism. 


In addition to the conventional 1-GB main memory 
buffer, the expanded mechanism BCIII uses the EMU 
buffer with its maximum capacity of 8 GB. The page 
where the buffer in main memory overflowed is saved in 
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Figure 5. Extended data management 

Key: |. Batch Job 2. Existing I/O Interface 3. Expanded 
Data Management 4. Implementation of high-speed data 
access using large-scale memory and EMU S. Utilization 
in large-scale database buffer space, sorting work space, 
high-speed system file, and work file 6. 4-PB (petabyte) 
Architecture 7. High-Speed Sorting Work 8. EMU File 9. 
Storage Space 10. Physical I/O Reduction 11. FMS File 
12. Database Data Space, Large Capacity File Space 
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the EMU buffer. For database access, by expanding the 
cache effect into two stages and substantially increasing the 
buffer bit rate, I/O operations to the file storage device are 
eliminated and database access performance is rapidly 
improved in on-line and batch processing. Thus to store in 
the EMU buffer space, the XV mode architecture described 
earlier is used. 


(c) High-speeu sorting 

In sorting large quantities of data, in addition to the sorting 
work space in main memory, by maintaining the sorting 
work file as an EMU file, large quantities of data are sorted 
at high speeds. 

2) High-speed OLTP processing 


In on-line transaction processing (OLTP), the main pro- 
cessing state in systems having database servers, advanced 
performance is demanded that is capable of real-time refer- 
ence and update transactions for large quantities of data 
input from on-line terminals. ACOS-6/NVX applies the 
data access acceleration technologies described above to 
various processes related to TDS-AF (OLTP system of 
ACOS-6/NVX) (Figure 6). 


The relational database (RIQSII V2) can be loaded as the 
on-line database. To speed up database access, BCIII is 
installed between main memory and the EMU. The index 
unit and data unit that can be accessed at high frequencies 
are under development. 


Work files related to transaction processing and transac- 
tion journal files can be installed as EMU files. 


Furthermore, by using the EMU paging function in which 
the virtual memory paging unit (run unit and backing store) 
that stores programs and data is allocated to the EMU, 
program loading times can be greatly reduced. 


By putting these functions to practical use, OLTP with 
better performance is achieved and the users’ real-time 
demands can be addressed. 
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Figure 6. High-performance OLTP processing . . 
Key: 1. Existing I/O Interface 2. Memory 3. Buffer 4. BCIII Buffer Space (Memory) 5. EMU File 6. EMU Paging 7. 
High-Speed Journal 8. Buffer 9. BCIII Buffer Space (EMU) 10. RIQS Data Space 11. Resident 12. Buffer bit rate 


increased by high capacity buffer 
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4. Conclusion 


We introduced the ACOS-6/NVX that supports the XSA 
architecture of ACOS System 3900. Within the trends of 
larger quantities of data, faster data processing, and 
higher reliability in information processing, mainframes 
will continue to occupy an important position in the 
future. ACOS-6/NVX must respond to these demands. 
The current situation is to perfect functions centering on 
the six concepts outlined in section 2. 
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Footnotes 


1. The UNIX Operating System is developed and 
licensed by UNIX System Laboratories, Inc. 


2. NQS is the Network Queuing System developed by 
Sterling Software for NASA Ames Research Center. 


3. SNA is a trademark of IBM Corp. IBM is a registered 
trademark of International Business Machines Corpora- 
tion. 
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